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ABSTRACT 

Issues related to the estimation of individuals' 
vocabulary size are discussed, including the rationale for vocabulary 
size research and the psychological, pedagogical, and quantitative 
approaches to vocabulary research and methodological problems 
associated with them. Some results from a large-scale assessment o£ 
Finnish comprehensive school students' active and passive 
vocabularies, word-formation skills, and contextual inference 
abilities in English are outlined. Resulting vocabulary research 
directions are suggested in two major areas: test types and student 
populations. It is recommended that research on test types focus on 
how to tap partial knowledge of word meanings and their effect on 
vocabulary size estimates and on estimation of vocabulary in the 
context of discourse comprehension and production. It is also 
suggested that the student populations studied be extended to include 
lower stages of voca K Mlary development, end-of -secondary school and 
university students, students with more training in word analysis and 
context utilization, and students at different ability levels, in 
addition, theoretical inquiry on the nature of vocabulary learning, 
teaching, and research is recommended. (MSE) 
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ESTIMATING STlttNTS' VOCABLUttY SIZES IN FOREIGN LANGUAGE 
TEACHING 



Saul i TakaU 



1 Introduction 

In thla papar I will dlacuaa aoma 4aauaa related to 
tha aatlmatlon of paopla'a vocabulary oiioa and praaant 
aoma raaulta from ona larga-acala oaaoaamant atudy. I will 
Mrat outllno dlffaront approaohaa to vooabulary raaaarch 
and than fooua on tha mathodoSogical prob'.anw relatad to 
quantltatlva aatlmatlon of acquired vooobul or laa. 1 will 
concluda by citing amp 1 Heal roaulta obtalnad from ona 
atudy whora aoma now Maaa In taat theory wora oppllad to 
vocobulary learning. 



2 Dlffaront approochoa to vocobulory roaoorch 
2.1 Why atudy vocabulary? 

At tho outaot wo ohould oddraaa too boalc quoatlont 
Why ahould onyona bo Intaraatod In vocabulary raaaarch? 
Why ahould vocabulary knowledge bo on Intaraat Ing end 
Important oroa for roaoorch? In own, why bother about 
voeobulory? Thora oro aoma Indications thot Mnguiatlca 
(a.g., Bollnoer, 1W5$ lt70| lf?4| rillmoro, If79$ Hallo, 
Broo.ton A Ml I lor t 191* » Hal H day 19U| Melchuk * Zolkov- 
aky, If 74$ Raekln, IMS) lo a hawing o Or awing Intaraat In 

tho rolo of tho > ax I can a*d I* lexlael proeo n 

Important port of lln$uletlc thoory. #ayohologlata and 
payehollngulata novo domonatrato4 oloorly for qulto aoma 
tlmo ago thot vocabulary knowladgo la tho woat prodlctor 
of raadlng comprohonalon (••»,•• Andoraon & Froobody, 
ifll). According to aoma •etlmtoa (o.f.> Freebody a 
Andoroofi, 1M1| Frufflklna, Mil Mm a Oft, lf72| Klychnl- 
kove, 197S), about 70 ft of tho wot* I* « toxt ahould ba 
known for o global undereteftdlno. of iU mtonlog, about 
90 ft for undart landing oil molo lefeee. end ohaut 9* ft tor 
ondoratondlng olao dotolla. Ttede, wo ooa concluda that 
voeobulory knowlodgo lo definitely on Waeartant proroqule- 
Ita for dlacourao comprohonalon, and ooolng how central 
looming, from text lo In achool and out-of-achool , wo hava 
ampla raoaon to maintain that voeobulory roaoorch la an 
Important oroo for roaaoreh ond doaarvaa, If anything, to 
ba atrangthonod ond Intoitolf led* 
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2.2 Approaches to vocabulary research 

Vocabulary raaaarch can have a number of different 
epproecheo. In thia papar 1 will dtocute thraa auch 
approochaa. I will call tham paycho logical , pedagogical, 
and quant 1 1 at Iva, raapact I valy. 

If vocabulary raaaarch haa a pjlc£oUg[ce± blca, 
aavaral quaatlono arlaa aa pooalbla raaaarch problem. How 
la vocabulary proceeeed In comparison to e.g., perception, 
eynteic or whole dlecouree? Whet le meant by knowing e 
word? How doae memory work In looming vocabulery (en- 
coding, etorege and retrlevel) end how "en different 
technlquee (e.g., keyword method, hook moth* ' *elbly 
fecllltote vooebulery looming? Whet ccuaea dlf « end 

whet fecltltetee vocabulary learning? 

Ef vocabulary seeeereti hee e pedagogi cal blee, 
eeverol other quoatloja merit attention. ¥mat words ehould 
be learned (leeue of eelect Ion)? Whet ehould be the neture 
of looming out comet at different etegee of e coureet 
beginning, Intermediate, final etege (leeue of objoctlvoo/ 
goals concerning deelred vocabulary knowledge eno ekllle)? 
How ehould worde be eement Icliad, I.e., how ehould their 
meenlnge be taught? How ehould word meenlnge be coneolld- 
•ted? timet ehould be oho role of conecloue ve. Incidental 
vooebulery leetnlngt 

If vocabulary reoeorch hee e quant I tat I ve blee, ea It 
may hove due to I to neture - ooneletTng ee It doee of e 
lergo amount ef different worde - we may eek aomewhot 
different queetlone. Whet le the totel alia of vooebulery 
In e lenfuoeo? Hew meny dlf ferent worde do people know? 
Hew many worde do erdlnery people uee, end how many worde 
do wrttere uee? How doee vooebulery grow In childhood end 
In the leter etegee of life? Hew convnon ere different 
worde? 

In order to get enawera to ouch queetlone, eeverol 
methodological p robl ems hove to be eolved. Whet kind of 
TooTTypoe con~6e r oooi , "Fe teet different klnde of vooebul- 
ery knowledge (validity leeue)? How eon we get good 
oattaotea Of tetel vocebulery aliea on the beele of e 
eemple ef worde (leeue of reeeerch dealgn, end preblemi 
related to rel lebll I ty/deeondebl 1 1 ty end general deb 1 1 - 
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3 Estimation of ttudtntt* vocebulsry sliss 

3.1 Problem 

Tht main pur pott of ths study wot to sttlmete the 
•Izt of ttudtntt 1 sctlvs tnd passive vocabulary In English 
•fttr they hod ttudlsd English for seven yeers (sbout 600 
Isssons, tbout 4*0 clock hours). Tor t moro detolltd de- 
scription of tho research problem, in tht tuthor't 
doctortl dlssortstlon (Tekele Iff*). 

3.2 Design 

In thlt popor wo oro Intorottod in ootlmetlng tht 
ovtrtll tlzo of Engl I §n vocebulery loornod by ttudtntt In 
tht Flnnlth ci/nprehensl ve school . Thus wo ore dee ling with 
program ovoluotlon end dome In- referenced (or crlttrlon- 
roftronctd) moeturoment . Wo with to gonorollio Into the 
whole universe ol content (I.e., t ought vocebulery) tnd 
Into tho whole population of ttudonto. This noons thot It 
It ntcottory to tpeclfy tho content dome In tnd drew t 
rordom temple from It. ftj*ly thlo kind of design moktt tuch 
two-way gonortl Izotlon »*o*slbls. In ouch o design, It It 
ntofui or oven almost necosssry to opply mcltl-metrl* 
svnpling, which moons thot different t'odente onowor pert- 
ly or totolly different Items. Thue oovorol toot forms ere 
rendomly rotetod In cleoe. 

Pcsulotlon, Tho flnol irgtt fepulttlsn of ths study 
wss dofTnoo r ~oT~ ll oll F Inn I sh- spesk I no slodeote In ths finsl 
grodo of 'normoP ^omprehonsl *• school cloooos*. 

Studsnt Sampling. Prollmlnory studios (Tokole 1984) 
hod shown~fiot "It Is Import en t to fomplo o oufflclent 
number of tchcols, while It would no* bo nocosssry to 
somple meny students from ooch school. Tho templing method 
woe o two-stogt otrotlflod cluotor eemple. Tho primary 
sempilng unit wos tho school end tho eecoooory templing 
unit wos tho closs. Four stratn were ueod with ths slis of 
school end tho degree of itfrbenliot loo of tho school 
comnunlty oo tho two boooo of st rot I fleet loo. 

Tho designed temple of school cone ie ted of 42 schools 
ond tho executed scmpls of >• seheele. ^Itofethor, 2.415 
students took port In tho study. 

Item S empil ng. Vocobulory olio eotlMtlon promised to 
be o gooiTstert Tng point for goner el I iebl 1 1 1 y studlss. It 
Is 'sborlous but possible, duo to Flolond's fslrly 
csntrelliod school system, to define too dome in snd ovsn 
Jill end count ths Items In tho domain. 

Two textbooks, which wore prectlcolly tho only ones 
ussd In schools, wort rovlewed end words taught In them 
were llstsd soporstoly. Textbook 1 tough* ebout 2,>0Q 
words for tho two hlghsr sots (Sots A end B) ond ebout 
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1,500 words for the lowett eat (Sat C). Textbook 2 taught 
■bout 2,850 worda end 2,340 words, respect I vely. From the 
two eeperete llatt, ■ total of about 950 worde wan random- 
ly drawn and distributed among 40 dlffarant taat forma. 
Thua aach atudant had to ^aapond only to 40-50 Items. 

Cartaln design lotuoo war a taatad In tha atudy ao 
that Itamt wara distributed to althar "a robuat atudant 
aampla" and a "less robuat student eemple". Thay ara not 
raportad hara (aaa Takala 1984). 

3.3 Choice of taet type 

Several taet typee ware conaldered. Tha conotructod 
enewar technique, In which etudente wrote the English 
equlvetente of deconr sxtus I lied Tlnnleh worde ("ectlve 
vocabulary") end vice veree ("passive vocebulory" ) , wae 
choean on both theoretical and prectlcel grounda. Tor e 
more detelled deecrlptlon of the ratlonela for the choice 
of the teet type, aea Takele (1914). 

Sample I tome 

Instructions! "In thle teet you cen ehow how wall you 
know tha Engllah vocehulery Included In your couroe work. 
Below ere pree*ntod e number of Flnnleh worde. Your teek 
le to write the Engl I ah equivalent on the Una ebnve the 
Finnish word. Write the word even If you may not bo quite 
euro about the correct ape 1 ling, elnca rolling mletekee 
ara a minor coneldorat Ion In ecorlng." 




"Write tha flnnleh equivalent! of tha following Engllwh 
worde." 
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4 Da It collodion and data analyait 

Dais on atudant "ocabulary knowlodga, and on tha con* 
tayt off la aching and laarnlng, wara col lac tad In lha 
•prlng off 1979. Dala f 1 la) building look mora lhan a yatr* 

$tudanl anaworo wars tcortd 0-1 wllh moaning aqulval- 
anca aa lha ultimata criterion (a.g«, d 1 aragardlng cpall- 
Ing). Inlarralar agraamtnt was off tha ordar off 95 %. 

Data war a analyiad ualng a loolitlo Ham analytla 
program and vocabulary alia aallmalaa wnra ablalnad 
Ihrough a naw variant a component ■ analyala, which uaaa lha 
ganaralltad aymnatr Seal tumt (gaa) n*athoo3« ll waa ahown 
lhal lha raaultt obtalnad wllh • naw program ara Idanllcal 
with Ihoaa compulad wllh CronbaoVa ffarmulae from tha 3F5S 
Rallablllly Program aw an aquaraa Indict.. 



5 Soma main raaul it 

Yha main raaulia aff lha a>udy can at bflafly twwaar- 
liad aa follow. 

Thara waa no rallablo dlfftrtnct In lha atudonta* 
paaalva and acfclva vocabulary Ntowloalgo, aa l^oy wtrt 
maaaurad In lha atudy, Alan, atudantt* knawl aoga off almpln 
word- formal loci rulaa and tholr contextual lnfaranco 
ability wara poorly da va lop ad, In canwarltan ta tvplsol Ll 
tkllla. Tha following roooeno wara aoavnaajt (1J Flnnlah 
and Engllah ara nol relttad languagna, will ah may on* on- 
couraga auch akllla. (2) Tha nmphaalt at thla attgn la on 
aynlacllcal pattarna, while marpfcolngy la lor only nag t act* 
ad. O) Tha Iraalmanl of taxta la •Isitanalve*, glv'nc 
aludanla llllla axpoouro lo Engllah. Tho 00* tooted avara^e 
tlsa off vocabulary (aaa ttbla 1, erlalnal eatfcnatee) waa 
aboul 1,000 worda, wllh groal vorleblllt* In parffiwanca. 
Faal laarnara knaw about 1#M0 wer«a, average atwdenta 
aboul 909 and tlow laarnara about 450 war 4a. (too In tM< 
I (ml lad word-fforn*tle«t akllla, tho aal law! at oueht la ho 
adjuatad by up In 45 V by 17 *, ana! ay 7 * for lha • Ihroo 
aala, raapacilvaly (aaa labia 1, corrected oatunal f a). Thn 
ralallonahlp balwaan taught and laarnnd voeewelery HI 
55 *. >t «, and 20 ft for lha thraa aato, rttpaatt ve* y * 1 J 

Tnbla 1. Original and Correttad EalaMlna for the fete! U1 
Ptttlvt and Acilvo Vecebelerv Slsee, by net 



Sal 




Original 


aa| Imataa 


Corrnctnn aatftnwtet • 1 - 






Paaalva 


Aollvo 


Aall vn 


•ear Ivt /content* 1 
aided ^ * * 


Sat 
Sat 
Sal 


A 
B 

C 


1,550 
950 
450 


1,450 
•5fl 
550 


2,006 
1,025 
450 


2,20* 1 " ,fc v 
1,050 
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Verlence componente analysis snowed that words made a 
greeter difference in icoru than atudanta and that arror 
of moeeuremer.t can ba lowered mora efficiently by Increee- 
ing thi number of word Items than b> taking a largar 
studont aampla. There may alao ba an optimal sire of Input 
In vocabulary iteming. Students who used a textbook with 
a lowar Input learned less then thoee whose textbook 
taught more words. 



6 Implications end conciueione 

Now that e now approech to • lerge-scole eeeeesmtnt 
of vocabulary alza hee been developed, tested empirically 
end found to be e promising line of study, eeverei ra - 
eeerch quest lone suggest thomeelvee. Theee can ba divided 
into two major groups. One hee to do with the taat typos 
end the other with stuJent populetione. 

Ae wee mentioned In the ebove, it was poeeible to 
teet only limited eepecte of vocebulery knowledge* namely 
relatively eolld end easily accessible poseivo end active 
knowledge of worde. Severe! experiments ought to be con- 
ducted with other teet types thet top more partial know- 
ledge of word meeninga and see hew vocebulery site eatinv 
etas ere effected* 

Slmllorly, etudents* knowledge of vocabulary in the 
context of dieceurea comprehene 1 on end production ought to 
be estimated* Such experiments v*ould provide dete to com- 
plement the beet line dete collected In the preeant study. 
It would than ba poeeible to oitimete, with a certain 
degree of confidence, thai If students' decontg xtueli led 
end firm knowledge s,f L? worde Is X, their moru pertlei 
knowledge of vocebulery le X ♦ Y words- etc. It can be 
conjectured thet partial knowledge of e fair mount of 
basic words combined with eeme knowledge of beeic morphol- 
ogUe! rules Mid the avolleblllty of an edequete context 
can lead to op edequate comprehene I or of teet ooooges ond 
t# provide e geoel opportunity for more word learning, 

Tba study ought to be extended to other populetione. 
With regard te |bf present study, It would be Important to 
teet etudonto' knowledge of lower etege vocebulery el the 
and of t**et eceoel stage, Thle would make it poeeible to 
explein with greeter confidence the finding thet luw*r 
ataoe vocabulary wee known better than upper etege vocab- 
ulary. Ia tfele ae already at that ptage or le lower ettge 
vocebulery rapaated during the upper etege, end thue the 
difference In le**nlng Ja attributable to an Inareaee In 
the opportunity to leern lower etege vocebulery? Thie 
question could be studied in even greeter detell by look- 
ing et eech eucceeelve grade and comparing the reaulte, 
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Vocabulary fiats aeesesment ahould alao ba extended to 
oldar populations. How many word* do atudanta know at the 
and of the senior tacondary achool? How many worda do L2 
msjure at tha unlvaralty know? 

Othar etudiee ought to ou^rooo tha quaatlon of how 
otudentr* ability to uoa word analyala akllla davalopo 
over time ee tha atudy of L2 progrefcoee. T eechlng experl- 
mente ought to bo earrlad out in which atudanta of differ- 
ent ago lavele are taught word analyala and context util- 
ization akllla If* order to eea what effect euch direct 
teaching would hove on etudante' vocebuiery efficiency. 

Further, since It wee found that expooure to more 
worde had e fovoreble Influence on vocebulory looming, It 
ehould be etudled whet axpoeure leede to optimal word 
looming for etudante of varying ability* it leame likely 
thet the roletlonehlp it not Hneer but mora likely an In- 
verted U-ohoped curve. 

In terma of curriculer implicetione enc aducotlonel 
equality concerne, it would be importent to etudy when the 
obearvad ierge dlfferencee in vocabulary aita In L2 
emerge, and whether sett ing/itreaming (and uelng different 
textbooke with different Input) tends to increeee or do- 
creeee euch dl'fsrencee. I? limited input (I.e., smaller 
vocebuiery slie teught) better for elow learnere or ie 
thet a miegulded notion? 

in eddition to euch empirical reeeereh, It would be 
useful to devote earns attention to more theorotlool aueet- 
ions on the neture of vocebuiery looming, teething, end 
resssr .. Is it, for Inetcnce, in the very neture of e 
domain Ilka vocabuiery that tha Input ehoald ba Urss, and 
thet tha number of wards known eolldly would wo low or 
converaciy the number of words ainost forgotten wouM be 
high? whet would thet moan for toochlng, tasting and 
grodi.ig? Ie, for inetence, tha oboervod large Item 
virioneo component an iodlcotlon of the failure of toech- 
!ng, or Ie It e neturel charecter 1 et i c of L2 9 Ml for thet 
matter LI, learning and performance? 

It ie obvioue thet e whale reeeeroh oregren Ie needed 
to Increeee our knowledge about veeebuSery teechlng ond 
ieerning brtn in LI end L2. Cloeo link* between lift ond L2 
vocabulary raeeerch ere of greet Import an cs for optimal 
progrees. It may ba mnre loborieuo »a keep track af what 
Sa being done In b9kh LI end L2 rooooreh, but that it 
nacoeeery to avoid duplication af effort and to OtlMio 
the etete of ort knowledge. Thie it one of the main 
laeeone that work on this investigation ha a provided* It 
.a time to put that belief ?nto practice, now that tha 
data Invite further eleboretlon. Thie will ho o toward I ng 
experience, einca vocebuiery roeoorch te*n)a"*4jo *he*o a 
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